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(54) Method of blindly detecting a transport format of an incident convolutional encoded signal, 
and corresponding convolutional code decoder 



(57) The actual transport format is ex ante unknown 
and belonging to a set of MF predetermined reference 
transport formats. The signal comprises a data block 
having an ex ante unknown number of bits correspond- 
ing to said ex ante unknown transport format, and a cy- 
clic redundancy check (CRC) field containing a trans- 
mitted CRC word. The method comprises decoding said 
signal using a Maximum-a-Posteriori algorithm, said de- 
coding step including decoding said signal considering 
respectively the MF possible reference formats and re- 
spectively delivering MF corresponding groups of soft 
output information, calculating from each group of soft 
output information a calculated CRC word and compar- 
ing said calculated CRC word with said transmitted CRC 
word, selecting all the groups for which said calculated 
CRC word is equal to said transmitted CRC word, and 
selecting the actual transport format of the encoded sig- 
nal from at least one soft output information among the 
last ones of each selected group. 
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Description 

[0001] The invention relates in general to channel coding and decoding techniques, especially convolutional codes, 
and more particularly to the blind transport format detection (BTFD). 
5 [0002] An application of the invention is directed in general to the field of wireless communication systems, and more 
particularly to the CDMA systems such as the different CDMA based mobile radio systems like CDMA 2000, WCDMA 
(Wide Band CDMA) or the IS-95 standard. 

[0003] The third generation mobile radio system specifies Convolutional codes as well as Turbo-codes as channel 
coding techniques [3GPP, Technical Specification Group Radio Access Network; Multiplexing and channel coding 
10 (FDD) ; (3G TS 25.212 version 3.5.0(2000-12)), Release 1999]. 

[0004] The UMTS standard defines, beside the TFC I (Transport Format Combination Indicator) base transport format 
detection, a possibility of blindly detect the transport format in use. More details can be found in 3GPP, Technical 
Specification Group Radio Access Network; Multiplexing and channel coding (FDD); (3G TS 25.212 version 4.3.0 
(2001-12)), Release 4. 

15 [0005] This option is used in order to reduce the TFCI symbol overhead in the transmission frame and thus to increase 
the air load. Further, as indicated in the UMTS standard, the explicitly blind detected transport channels have to be 
coded using a convolutional code. 

[0006] Generally speaking, the blind transport format detection is based in the use of the CRC (Cyclic Redundancy 
Check) words. An example of such detection can be found in the annex of the above mentioned 3GPP document. The 

20 main idea is to use a Viterbi decoder for decoding the different possible block sizes corresponding to the possible 
transport formats which can be used in the transport channel under investigation. Then, a CRC word (Cyclic Redun- 
dancy Check word), has to be checked for all possible block sizes. A correct CRC determines the correct block size. 
For large CRC length, the CRC check is sufficient to determine the correct block. However, in the case of smaller 
CRCs, like 8 or 12 bits, an additional metric has to be deployed in order to distinguish between two or more blocks 

25 with a correct CRC. And, when a Viterbi decoder is used, an additional calculation is needed for calculating this addi- 
tional metric. 

[0007] The invention intends to provide a solution to this problem and proposes a different method of blindly detecting 
a transport format of an incident convolutional encoded signal. 

[0008] Generally speaking, said transport format being ex ante unknown and belonging to a set of MF predetermined 
30 reference transport formats, said signal comprising a data block having an ex ante unknown number of bits corre- 
sponding to said ex ante unknown transport format, and a Cyclic Redundancy Check (CRC) field, containing a trans- 
mitted CRC word, the method according to the invention comprises decoding said signal using a Maximum-a-Posteriori 
algorithm, said decoding step including decoding said signal considering respectively the MF possible reference for- 
mats and respectively delivering MF corresponding groups of soft output information, calculating from each group of 
35 soft output information a calculated CRC word and comparing said calculated CRC word with said transmitted CRC 
word, selecting all the groups for which said calculated CRC word is equal to said transmitted CRC word, and selecting 
the actual transport format of the encoded signal from at least one soft output information among the last ones of each 
selected group. 

[0009] Thus, the invention uses already calculated soft output information (LLR information) as additional metric to 
40 distinguish between two or more blocks with a correct CRC. This is possible since the proposed architecture uses a 
MAP base decoder for decoding the CC code. In this case, no additional metric calculation is needed like in the case 
of the Viterbi decoder. 

[0010] According to an embodiment of the invention, the actual transport format of the encoded signal is selected 
from the last soft output information of each selected group. 
45 [001 1 ] The actual transport format is for example the reference format having the greatest last soft output information. 
[0012] However, other possibilities exist for selecting the actual transport format. For example, we can use the last 
but one soft output information, or the last but one and the last soft output information. Another possibility consists in 
combining one of the last soft output information with the minimum soft output information. 

[0013] According to an embodiment of the invention, said decoding step comprises calculating state metrics, and all 
50 the data blocks are decoded in parallel window-by-window on sliding windows having a predetermined size, and at 
least some of the states metrics calculated on a window are valid for all the data blocks. 

[0014] According to a preferred embodiment of the invention, said decoding step comprises for each window calcu- 
lating forward state metrics during a forward recursion, performing a backward acquisition having a predetermined 
acquisition length, calculating backward state metrics during a backward recursion and calculating soft output infor- 
55 mation in the reverse order, and for each window only one forward recursion is performed which is valid for all the 
transport formats. 

[0015] According to an embodiment of the invention, the first window processing including forward recursion, back- 
ward acquisition, backward recursion and soft output calculation, is completely valid for all data blocks having a size 
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larger than the sum of the window size and the acquisition length. 

[0016] In other words, according to particular embodiments of the invention, there is a complete reuse of forward 
state metric calculation for all different block sizes and partial reuse (if possible) of backward state metric calculation 
and LLR calculation for different block sizes. 
5 [0017] The invention proposes also a convolutional code decoder, comprising 

input means for receiving an convolutional encoded signal having an ex ante unknown transport format belonging 
to a set of MF predetermined reference transport formats, said signal comprising a data block having an ex ante 
unknown number of bits corresponding to said ex ante unknown transport format, and a Cyclic Redundancy Check 

10 (CRC) field containing a transmitted CRC word, 

convolutional code decoding means implementing a Maximum -a- Posteriori algorithm for decoding successively 
said signal by considering respectively successively the MF possible reference formats and comprising a Log- 
Likelihood-Ratio unit for successively respectively delivering MF corresponding groups of soft output information, 
blind transport format detection means comprising a Cyclic Redundancy Check unit for calculating from each group 

15 of soft output information a calculated CRC word and comparison means for comparing said calculated CRC word 

with said transmitted CRC word, first selection means for selecting all the groups for which said calculated CRC 
word is equal to said transmitted CRC word, and second selection means for selecting the actual transport format 
of the encoded signal from at least one soft output information among the last ones of each selected group. 

20 [0018] According to an embodiment of the invention, said second selection means are adapted to select the actual 
transport format of the encoded signal from the last soft output information of each selected group. 
[0019] According to an embodiment of the invention, said convolutional code decoding means are adapted to cal- 
culate state metrics, all the data blocks are decoded in parallel window-by-window on sliding windows having a pre- 
determined size, and at least some of the state metrics calculated on a window are valid for all the data blocks. 

25 [0020] According to an embodiment of the invention, said convolutional code decoding means are adapted for each 
window, to calculate forward state metrics during a forward recursion, to perform a backward acquisition having a 
predetermined acquisition length, to calculate backward state metrics during a backward recursion, and for each win- 
dow only one forward recursion is performed which is valid for all the transport formats. 

[0021] According to an embodiment of the invention, the first window processing including forward recursion, back- 
30 ward acquisition, backward recursion and soft output calculation, is completely valid for all data blocks having a size 
larger than the sum of the window size and the acquisition length. 

[0022] The code decoder according to the invention can be a combined Turbo-code/convolutional code decoder 
further comprising Turbo-code decoding means for performing Turbo-code decoding. 

[0023] And, according to an embodiment of the invention, said Turbo-code and convolutional code decoding means 
35 comprises common processing means having a first configuration dedicated to Turbo-code decoding and a second 
configuration dedicated to convolutional code decoding. And said decoder further comprises : 

metrics memory means for storing state metrics associated to the states of a first trellis and delivered by said 
processing means in its first configuration, 
40 - input/output memory means for storing input and output data delivered to and by said processing means in its 
second configuration, 

adaptable memory means for storing input and output data delivered to and by said processing means in its first 
configuration, and for storing state metrics associated to the states of a second trellis and delivered by said process- 
ing means in its second configuration, 
45 - control means for configurating said common processing means in its first or second configuration depending on 
the kind of code, and 

memory control means for addressing differently said adaptable memory means depending on the configuration 
of said common processing means. 

so [0024] Said common processing means may implement a Maximum-A posteriori (MAP) algorithm. 

[0025] And, the MAP algorithm implemented is for example a so-called LogMAP algorithm or a so-called MaxLogMAP 
algorithm. 

[0026] The convolutional code decoder according to the invention may be advantageously realized by an integrated 
circuit. 

55 [0027] The invention proposes also a terminal of a wireless communication system including a decoder as defined 
above. 

[0028] This terminal may form a cellular phone or a base station. 

[0029] Other advantages and features of the invention will appear on examining the detailed description of embod- 
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iments, these being in no way limiting and of the appended drawings in which : 

figure 1 shows the structure of the two different NSC used in UMTS for Convolutional encoding, 
figure 2 shows a part of a trellis which represents the possible transitions in one time step, 
5 - figure 3 shows a receiving chain of mobile phone including a decoder according to the invention, 

figure 4 shows very diagrammatically the internal structure of a first embodiment of a decoder according to the 
invention, 

figure 5 shows more in detail a part of a decoder according to the invention, 

figure 6 illustrates diagrammatically possible transport formats, 
10 - figure 7 shows diagrammatically a part of blind detection means according to the invention, 

figure 8 shows diagrammatically a CRC processing scheme according to the invention, 

figure 9 shows diagrammatically a windowing scheme according to the invention, 

figure 1 0 illustrates a flow chart of a blind detection according to the invention, 

figure 11 shows an UMTS Turbo-code encoder, 
15 - figure 1 2 shows a generic Turbo decoder, 

figure 13 shows very diagrammatically the internal structure of a decoder according to a second embodiment of 

the invention, 

figure 14 shows more in detail a part of the decoder illustrated in figure 13, 

figure 15 shows more in detail an adaptable memory belonging to a decoder according to the invention, 
20 - figure 16 shows diagrammatically an ACS unit architecture, 

figure 1 7 shows an LLR unit belonging to a combined decoder according to the invention, 
figure 1 8 shows global control steps for Turbo-code decoding according to the invention. 

1 Encoding 

25 

1.1 General Considerations and Convolutional encoding 

[0030] Convolutional encoding is performed by calculating the modulo-2 sum of the input values of the current and/ 
or selected previous time steps. Implementation therefore is straightforward and mainly consists of shift register and 
30 a couple of exclusive-OR gates. Through the way those are switched, different kinds of convolutional codes can be 
realized: 

Systematic Codes: One of the output streams is equal to the input stream, the systematic information. 
Non-Systematic Codes (NSC): Each output is a parity information. Parity information is produced by taking the 
35 modulo-2 sum of shift register entries stating the history of the encoding process. 

Recursive Codes: A special parity signal is produced and fed back in conjunction with the systematic input. 
Non-Recursive Codes: No such feedback loop exists. 

[0031] An instance of a convolutional encoder is defined by a combination of these properties, the memory depth 
40 (constraint length) and the logical functions used to produce the parity information. These properties are described 
through generator polynomials. 

[0032] Figure 1 represents the structure of two different NSC used in UMTS for Convolution and Convolutional en- 
coding. Furthermore, two different rates have to be considered. The rate 1/2 convolutional encoder (a) has two outputs, 
whereas the rate 1/3 encoder (b) has three outputs. 
45 [0033] With the value of the current time-step available, M = K-1 flip-flops are needed to store the encoder history. 
This gives rise to an interpretation of the encoder as finite-state machine (FSM). It shows a behavior equivalent to a 
Mealy-automaton. The likelihood of a transition between two states is the key to the decoding of convolutional codes. 

1.2 Code-Trellis 

50 

[0034] A code trellis is the unrolled state chart of a finite-state machine. The number of states the encoder can be 
in (N) is a function of the constraint length K: 



N = 2K-1. 

[0035] Depending on the nature of the code (RSC, NSC,...) only certain transitions are possible. A trellis is used to 
depict those transitions. In figure 2, a part of a trellis is shown, that represents the possible transitions in one time-step. 
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Instead of the usual tree-structure used to display state-charts, the trellis combines states which are equivalent. Solid 
lines in figure 2 stand for transitions due to the input of a systematic bit of "0" whereas dashed lines represent those 
caused by a "1". 

[0036] On convention, every encoding process starts in the all-zero state. 

5 

1 .3 Trellis-termination 

[0037] For the considered codes the initial state of the trellis is always known to be the all-zero state. Without taking 
any precautions, the encoder ends in an arbitrary state, leaving no hint where to start the backward recursion. This 
10 can be counteracted by driving the encoder into a defined final state. Reaching the final state (e.g. the all-zero state) 
can be achieved by appending a sequence, which steers the encoder towards the final state as fast as possible. This 
sequence is also depending on the state the encoder is in after the last information bit has been coded. The length of 
this sequence is equal to K-1 ; the transmitted bits are called tailbits. 

15 2. Decoding 

[0038] 2.1 Decoding convolutional codes is keeping track of the transitions that took place in the encoder. From 
those the input symbols which have been sent are deducted. Due to the degradations caused by the channel, only 
estimates of the systematic and parity bits are available, which will both be called channel values here. There are two 
20 different kinds of outputs: 



[0039] Based on the channel values, probabilities can be computed that certain combinations of systematic and 
parity bit occurred. From this and considering the encoder history, the probability that the encoder was in a given state 
at a given time-step can be computed. 

[0040] Two approaches exist to deal with those state-probabilities. The maximum likelihood based Viterbi algorithm 
30 uses them to search the most likely code-word. For this it traverses the trellis from the all-zero state to the end state 
and looks for the most likely sequence. The states chosen for the survivor path indicate the most likely sequence of 
symbols that has been send. Hence a Viterbi Decoder is a sequence estimator. 

[0041] The maximum-cc-posteriori (MAP) algorithm on the other side estimates the probability that the encoder was 
in the given state and that the current state leads to the final state given the remainder of the channel values. This can 
35 be efficient computed by a forward and backward recursion over the trellis. Afterwards, for each bit the probabilities 
for those states associated with a systematic "0" are added and compared to those associated with a "1 ". The symbol 
with the higher probability is assumed to be the sent one. As this works on bit rather than on sequence level, it is called 
symbol estimation. 

40 2.2 The maximum-A-Posteriori (MAP) Algorithm 

[0042] The name Maximum-A-Posteriori stems from the fact that the estimation of the bits is based on the whole 
receiver sequence. It is done after all the information is in. 
[0043] Equation 2.1 shows the output of such a MAP decoder. 
45 [0044] Bahl et al. described in [L. Bahl, J. Cocke, F. Jelinek, and J. Raviv. Optimal Decoding of Linear Codes for 
Minimizing Symbol Error Rate. IEEE Transaction on Information Theory, IT-20:284-287, march 1974] an efficient algo- 
rithm for the MAP decoder, which is based on recursions operating on the trellis in forward and backward recursion. 
That algorithm is commonly referred to as MAP or BCJR algorithm: 

[0045] Let R k denote the input of the MAP, with R = (R A ,.,.,R k ,,..R N ) where N is the length of the block, then the 
50 BCJR-algorithm computes the a-posteriori probabilities (APP) 



Hard values: they merely indicate if a symbol is supposed to be "1" or "0". 

Soft values: These also deliver a measure for the reliability of the decision (the hard decision is extend by the 
probability that the decision is correct). 



25 



55 





for each data symbol d k after reception of the symbol sequence R. 
[0046] It is computed using two probabilities: One, that the encoder has reached state S m , with 
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after k received symbols: 



a k {m) = J>r{s? | Ro-.R^} ( 2.2) 

m 1 

and another, that the remainder of the input sequence will lead the encoder to the final state given the state S 
at time k+1 : 

P k+1 (m')-l>r{R k ...R N \S?: i } (2.3) 

[0047] For this, the probability of a transition from state to S m has to be known. It is depending on the code 
structure, the channel model and the received symbols R k \ 

= Prfe.S^ \R k } ( 2.4) 

[0048] Using y, a and p can be computed recursively by: 



a k {m') = Ja^Cm)^-,,^') < 2 " 5 > 



m 



P k (m) -^P^W-YiS^S?) ( 2.6) 

m 

[0049] A known start and final state are necessary for the BCJR algorithm to perform optimally. If the trellis is not 
terminated, all states have to be assumed to have equal probability for k = N. 
[0050] The a-posteriori probability itself can be expressed as 



J ^ Y (S^,Sf,d k = l)-« w («) • &(m') 

A(dk) = ln 2%rW_ v sz\d k = oya^m) ■ &0') ( 2 7) 



m m 



[0051] When the constituent encoder is a NSC, this equation can be simplified because the state number can be 
used to identify a state reached by input d k = 1 or d k = 0. 



AW= 2 w -.".(y)-ft(s;) 



[0052] The large number of multiplications involved in the computation of the APP makes it less attractive for imple- 
mentation. Therefore the MAP algorithm has to be transformed to the logarithmic domain, where it becomes the Log- 
M AP algorithm, which increases numerical stability and eases implementation, while not degrading the error correction 



EP 1 471 677 A1 



performance. 

2.3 The MAP algorithm in the logarithm domain : LogMAP 

[0053] The transformation of multiplications into additions is the motivation for defining the MAP algorithm in the Log- 
Domain. A problem is posed by the additions. Using the Jacobian logarithm, the additions are substituted by a new 
operator: 

ln(e 51 + e 52 ) = max* (51 , 52) = max(51 , 52) + ln(1 + e " 161 " 52 ') 
[0054] Similar the negative logarithm can be taken, this leads to 

min* (51,52) = min(51, 52) - ln(1 + e ~ |61 ~ 621 ) . 

[0055] For more than two operands, the max* is applied recursively. Since the operator is associative, a tree-like 
evaluation can be employed, which is advantageous for hardware implementation. The sub-optimal MaxLogMAP al- 
gorithm is obtained by using the approximation 

max* (51, 52) « max(51,52) . 
[0056] Using the max* operation, the recursions become: 



m ~m 



\n(a k (m°)) = max m (Infa^ (m)) + \n(y(S k _, ,S k )), (2.8) 



ln(p fr (m)) = max m (ln(p fc+1 (m)) + ln( Y (S™, S™ +1 )) (2.9) 
[0057] Let ln(a fc (m')) from now on be denoted as a^m) (accordingly for p and y), then the recursions take the form: 

a k (nT) = max' m (a k Mm) + y (SZ V S%) (2.10) 

f k (m) = max* m (p fc+1 (m') + y(S^,S^)- (2.11) 

[0058] Similar we get: 

A(d fr ) = max* mm .(Y(S^ 1> S^;,S^, d k = 1) + a k .,(m) + F k (nT)) 

-max mm .(y(S l k n _ v S k n ,d k = 0) + a k -, (m) + f k (m')) (2.12) 

and in case of an NSC encoder: 

A(d k ) = max; ? J<r k (S:) + £(*;)) (2 n&) 

- max s ^Jcr h (S;) + fcW)) 

[0059] Computation y of includes the estimation of channel values. An optimised branch metric calculation is used. 
In case of an NSC encoder the channel values are parity informations. Dependant on the rate there are only four resp. 
eight different values per k in total the y can take. The code-structure alone determines, which of them is assigned to 
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which transition. After skipping constant factors and making additional algebraic transformations we get (eq. 2.13) 
Rate 1/2: 

AC G1 

- / GO , G1 4t s//c 

y(x k = ^,x k =-l)=_ — 

/v o 

, r- GO 

-, GO , G1 j v 4t s>^ 

Y (x fr = -1,x„ — 

/v 0 



Rate 1/3: 



A - GO . ,- GO 

-.GO , G1 iX 4^s//c 4F s//c 
Y (x, = -1,x, .-1>._^_^ 



-.GO . G1 , G2 . 
Y(*/r =+1^/c = +1 '*/c =+1) = 0 



/i /r G2 

-/GO , G1 , G2 4t s>V 
y(x^ -+1.X* -+1.X* 

/v 0 



A r- G1 

-/GO H G1 H G2 HX 4t s^/f 

Y(x^ --1.x* =+1) = — — 

/v 0 



, c G1 „ c- G2 

-/Go H G1 H G2 u \ y k 4t 
Y(x^ --l.x* =-1) = — — + 

/v o /v 0 



/I zr G0 

-.GO H G1 . G2 4t s-K/c 

Y(x^ --1.x* -+1.X* — 

/v 0 



GO , G1 , G2 



. _ GO . ,- G2 



y(*k =-^ x k = + ^ x k =- 1 )= + w 

. _ GO , G1 

-.GO , G1 , G2 4E s//c 4E s//c 

Y (x^ m-1,x k --1.x* = + 1)=_^— + _^— 

X CT G0 A C G2 

-/GO H G1 H G2 .v 4t s>V 4t s>V 
= " 1 '*/c = " 1 >*/c =- 1 )= + yy- 

[0060] This simplifies the implementation significantly, as only up to four terms have to be computed from the channel 
data. One term can be dropped completely and the last one be computed from two others. The scaling factor — - is 



"o 



multiplied externally by usage of a working point. 
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3. Windowing 

[0061] The MAP algorithm minimizes the probability of bit-errors, basing the decision for each bit on the knowledge 
of the complete block of samples, that is a posteriori. It has been shown, however, that a sliding window technique, 

5 where a window slides in direction of increasing bit positions k, delivers almost the same communication performance 
as the original MAP decoder. Then the decisions are based on a subblock beginning at the first bit position in the 
complete block and ending at the last position in the sliding window. The MAP algorithm can decide all bits belonging 
to the window, less the bits contained in the last portion of that window. Those bits in the last portion are decided when 
the window has moved to its next position(s). If only the bits in the middle part get decoded, then the window does not 

10 even have to slide (that is to move steadily in one direction). 

[0062] When we look at the equations of the MAP algorithm, one can easily identify four subtasks: 

Calculation of the branch metrics (step 1) 

Calculation of the forward state metrics during forward recursion (step 2) 
15 Calculation of the backward state metrics during backward recursion (step 3) 

Calculation of the soft outputs (step 4) 

[0063] The data dependencies between these steps are as follows: both recursions (step 2 and step 3) and the soft 
output calculation (step 4) depend on the branch metrics (step 1 ), and the soft output calculation step (step 4) in addition 

20 depends on the forward and backward state metrics (step 2 and step 3). All branch metrics and soft outputs of the 
current data block can be computed independently from each other. Only the order of computing the forward state 
metrics and the order of computing the backward state metrics are predefined by the direction of the respective recur- 
sion. The sequence of the recursions does not matter, since there is no data dependency between the forward and 
backward state metrics. The backward recursion can be processed before, in parallel to, or after (as implied above) 

25 the forward recursion. Hence, we can introduce the notion of a first and a second recursion. The metrics for the first 
recursion for a certain trellis step have to be stored in memory until the second recursion has produced the missing 
complementary metrics for computing the soft output value connected with that trellis step. Thus the decoder needs 
to store the first recursion metrics of the full data block. Introducing windowing, each comprising a data subblock, 
breaks this dependency. Decoding on a window-by-window basis permits to reduce the required memory size. 

30 [0064] A prerequisite for decoding on windows is the concept of "acquisition". Originally, the forward and the backward 
recursion of a MAP decoder start at one end of the trellis and stop at the opposite end. Upon an appropriately long 
acquisition phase, however, a recursion can start at any trellis step. This applies to both forward and backward recur- 
sions. 

[0065] Consider a forward acquisition commencing at trellis step k-M, where M is the acquisition depth. The forward 
35 state metrics are initialized as: 

(s k _ M ) - o, s k _ M «= {,...a m - 1} (3.D 

40 

[0066] After M recursion steps, the metrics a k (S k ) approach the values that would be obtained by starting the recur- 
sion from the beginning of the trellis. Now consider a backward recursion commencing at trellis step k+M. The backward 
state metrics are initialized as: 

45 

Km(S m )-0, S k+M G{,...,2 m -l} (3.2) 

[0067] In analogy to the forward recursion, after the same number of recursion steps M, the metrics $ k (S k ) approach 
50 the values that would be obtained by starting the recursion at the end of the trellis. If the acquisition depth M is too 
small, then the decoding performance can be severely degraded. With M above a certain value, the decoding perform- 
ance is virtually optimal. The value of M that leads to a reasonable balance between computational effort and decoding 
performance can be determined by simulation. 

[0068] The windowsize itself has no influence on the communication performance, but on the size of the RAM for 
55 storage of the cc-state metrics. Furthermore the throughput is dependant on the ratio between acquisition length and 
windowsize. When only one state metric calculation unit is available, the individual windows are processed sequential. 
It takes one clock cycle for the forward recursion and one clock cycle for the backward recursion/LLR calculation per 
data bit. The additional computational overhead is determined by the total number of acquisition steps. If the windowsize 
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is equal to the acquisition length, the decoding needs one additional clock cycle per bit, resulting in total three clock 
cycles per bit. If the windowsize is much larger than the acquisition length, the computational overhead goes near zero, 
resulting in a throughput of nearly two. So the choice of the windowsize is a trade off between memory size and 
throughput. 

5 

4. First embodiment of a convolutional decoder. 

[0069] 4J_ We refer now to figure 3 which illustrates a decoder according to the invention which is incorporated in 
the reception chain of a cellular mobile phone TP. 
10 [0070] The encoded signal is being received by the antenna ANT and processed by the radio frequency stage ERF 
of the receiver. At the output of the ERF stage, the signal is converted into the digital domain by an A/D converter. The 
digital base band signal is then processed by a "rake" demodulator which is used generally in the case of a CDMA 
system. 

[0071] Then, the channel decoding stage includes a convolutional code decoder CTD according to the invention. 
15 [0072] The decoder CTD according to the invention, as illustrated on figure 4, comprises convolutional code decoding 
means CCDCM for performing convolutional code decoding. 
[0073] The decoding means CCDCM implements a MAP algorithm. 

[0074] Further, input/output memory means referenced as CC I/O RAMs are provided for storing input and output 

data delivered to and by the decoding means CCDCM. 
20 [0075] Further, another memory, referenced CCa-RAM, is used for storing forward state metrics (a-state metrics) 

associated to the states of a trellis (for example 256 states) and delivered by the decoding means CCDCM. 

[0076] At last, the decoder CCD comprises blind transport format detection means BTFD for blindly detecting the 

transport format used in a transport channel under investigation. The internal architecture of these means BTFD will 

be described more in detail thereafter. 
25 [0077] Figure 5 illustrates more in detail internal structure of the convolutional code decoding means CCDCM. 

[0078] These means comprise essentially three major calculation units, i.e. : 

a BM unit that calculates the branch metrics BM and controls the CC I/O RAMs. The calculation of the branch 
metrics is performed according to equation 2.13, depending on the value of the rate, 
30 - the state metric unit SM which calculates state metrics SM according to equations 2.10 and 2.11, 

the LLR unit, which calculates the soft output information (LLRs) in a pipeline manner, according to equations 2. 1 2a. 

[0079] The input RAMs consist in three RAMs referenced as GO-RAM, G1-RAM and G2-RAM according to the 
respective convolutional code input data. Output RAM is a LLR-RAM. 
35 [0080] The blind detection of the transport format will be now described more in detail with reference to figures 6-1 0. 
[0081] In figure 6, the structure of a coded block in the case of BTFD is depicted. 

[0082] The transport channel under consideration has MF different transport formats TF 1 to TF MF and thus MF dif- 
ferent coded blocks sizes. A coded block contains a CRC field (transmitted CRC word) and the data bit field. The CRC 
field in each of the different blocks has the same size and thus uses the same polynomial. In the presented case the 

40 coded block only contains one CRC at the end. The DTX field can be considered as being noise. 

[0083] The blind transport format detection means comprises a Cyclic Redundancy Check Unit CRCU (figure 7) for 
calculating from each group of soft output information (LLR) corresponding to each transport format, a calculated CRC 
word. This calculated CRC word is stored in a CRC register CRG1 . Therefore, certain input parameters are necessary, 
which are the length of the CRC and the CRC polynomials. 

45 [0084] The transmitted CRC word (CRC sum) which is attached to the data block in reverse order during the encoding, 
is stored in the register CRG2. 

[0085] Comparison means are adapted to compare the content of the register CRG1 with the register CRG2. When 
equal, the CRC check is positive. 

[0086] The LLR bits are stored in a buffer BFF of windowsize (for example 53 bits for convolutional code). These 
50 bits are shifted out of the buffer into the register CRG1 during forward recursion and backward acquisition (figure 8). 
[0087] This architecture can be extended to check up to MF=1 6 different block sizes by usage of total 1 6 CRC check 
units. The buffer has to be implemented only once, because the LLRs of the different blocks are calculated serially. 
So only the shaded blocks in figure 7 are multiple instantiated. 

[0088] As mentioned before, a sliding window approach is used. The MAP algorithm minimizes the probability of bit- 
55 errors, basing the decision for each bit on the knowledge of the complete block. Originally the forward and backward 
recursion of a MAP decoder start at one end of the trellis and stop at the opposite end. This starting points are known 
for the forward recursion (state 0) and if tailing is applied as in UMTS, we also know the end point (= starting point for 
backward recursion) of the trellis (state 0). However upon an appropriate long acquisition phase we can start at any 
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trellis step. This applies for both forward and backward recursion. The starting point for the forward recursion is always 
known, because we start the calculation with the alpha recursion. Furthermore the starting points for the following 
window is known, because we already calculated the state metrics and stored them in the alpha RAM. For backward 
recursion we need acquisition to get the appropriate starting points when calculating the backward recursion for the 
5 window. So we normally do the normal (full) acquisition to initialize the backward recursion. This is done until we get 
to the last two windows. These windows are somewhat special: 

last window: We don't do a normal acquisition, because we don't have data to do so. But we don't need this at all, 
because we have the tailbits, which are used to give us the init state metrics for backward recursion. 
10 • last but one window: It could happen, that there are not enough data bits left to do a full acquisition. But this is no 
problem because we simply start with the tail bits and then process the remaining acquisition steps. Than we 
proceed as usual with backward recursion/LLR calculation. 

[0089] It would be possible to adapt this scheme for the blind transport format detection. However, if we consider for 
15 example two transport formats of sizes 108 and 120, then, when we determine the LLRs for block length 108, we 
cannot calculate the LLRs for block length 1 20 without starting the forward recursion from the second window. And the 
initialization values for the forward recursion have to be stored in specific memory, otherwise, we have to start the 
forward recursion from the beginning. 

[0090] When one wants to reuse as much as possible, the calculation scheme has to be preferably changed. The 
20 individual block lengths are not calculated serially, but in parallel. This is shown in figure 9. The actual stored alpha 
values are used for calculating all possible LLRs of this window. This way, we have only one forward recursion for all 
the different transport formats, only the backward recursion/LLR calculation is individual. Furthermore even some back- 
ward recursion/LLR calculations are done only once for more than one block size. So we have a total reuse of all so 
far calculated alpha values and the possibility of partial reuse of calculated beta values and LLRs. As there can be 
25 more than one block length with correct CRC, the absolute values of the last LLR can be used to distinguish between 
two or more blocks with a correct CRC. These values have to be stored additionally. 

[0091] Figure 9 shows the windowing scheme for the already mentioned example for two different block sizes 108 
and 1 20. The processing starts with forward recursion for the first window. After the backward acquisition the backward 
recursion/LLR calculation follows. The forward recursion is only done once, the alpha state metrics are valid for both 
30 block sizes. Because the second block size is larger than the sum (window size+acquisition length), the backward 
recursion/LLR calculation is valid for both blocks. So the first window processing (forward recursion, backward acqui- 
sition, backward recursion/LLR calculation) is completely reused for both block sizes. 

[0092] The next window starts as usual with forward recursion. Because this is the last but one window for the first 
block size (108) in this example, there is no full acquisition but the tail bit processing and a partial acquisition. Afterwards 
35 the backward recursion/LLR calculation for the second window follows. These calculated LLRs are individual for the 
first block size and can't be reused. So for the second block size a separate backward acquisition and backward 
recursion/LLR calculation has to be performed. In our example the acquisition also consists of tail bits and partial 
acquisition. 

[0093] The next window starts with the forward recursion. The forward recursion is always done for the whole data, 
40 so it is either a complete window or for the last window consists of the remaining bits. So in our example it is done for 

the remaining 14 bits. Then the last backward processing for block size 108 starts with the acquisition (done with tail 

bits) and then the backward recursion/LLR calculation. After all LLRs for block size 108 are calculated the results of 

the final CRC is available. The last backward processing for block size 120 is done similar. 

[0094] The proposed general processing flow for BTFD is shown in figure 10. 
45 [0095] In step 100, "shift (i,...i max )" means that the calculated LLR bits are stored in all the CRC registers CRG1 

respectively associated with all the possible formats (i min ...i max ). However, in steps 101 and 102, "shift(i)" means that 

the LLR bits are stored in the CRC register associated to the current format i. 

5. Second embodiment of a convolutional decoder 

50 

[0096] A second embodiment of a decoder according to the invention will be now described. This decoder is a com- 
bined Turbo-code/convolutional code decoder, as for example the one which has been disclosed in European patent 
application n° 0 201 9974.1. 

[0097] The main features of such a combined decoder will be now described. 
55 [0098] However, before describing in detail the internal architecture of such combined decoder, some general con- 
siderations on Turbo-code encoding and decoding will be now described. 

[0099] 5.1 A Turbo code encoder consists of two constituent convolutional encoders and an interleaver. The convo- 
lutional codes are fixed to be the RSC codes of rate 1/2 and generator polynomials (13,15/(octal notation) introduced 
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before. 

[0100] The systematic information of the second encoder is not transmitted because it can be reconstructed (by 
deinterleaving) from the systematic output of the first encoder. By this a rate of R = 1/3 is achieved. Figure 11 shows 
the detailed UMTS Turbo code encoder. The trellis termination leads each encoder into its final state separately. This 
dissolves the dependency between the systematic information of the first and second encoder for the tailbits, because 
these lead each encoder independent from the other by activating the respective switch, see figure 11 . Hence the last 
six bits per encoder (systematic and parity for each) have to be transmitted separately. This results in a total overhead 
of 12 bits per block. 

[0101] Decoding Turbo codes by searching the most likely codeword is far too complex. Therefore iterative decoding 
is advised. The two convolutional codes are decoded separately. While doing this, each decoder incorporates infor- 
mation that has been gathered by the other. This "gathering of information" is the exchange of soft-output values, where 
the bit-estimates of one unit are transformed into a priori information for the next. The decoders hence have to be soft- 
input soft-output (SISO) units. 

[0102] The confidence in the bit estimation is represented as a Log-Likelihood-Ratio (LLR): 

P(d k = 1) 

[0103] The sign shows whether this bit is supposed to be one or zero whereas the confidence in the decision is 
represented by the magnitude. 

[0104] In order to extract the information that has been gathered during the last decoding stage, the systematic and 
a priori information that lead to this estimate have to be subtracted. This yields: 

L\d k ) = \\d k )- y s k - L 2 deint (d k ) 

L 2 (d k ) = A 2 (d k )- y s kM - 0 mx {d k ) 
[0105] This is called the extrinsic information. 

[0106] The confidence of one decoder in a bit to have a certain value biases the initial guess of the other. 
[0107] Figure 1 2 shows such a Turbo code decoder consisting of two MAP decoders, an interleaver and a deinter- 
leaver. Feeding the input of one decoder as a priori information input to the next enables the improvement over the 
decoding iterations. It also gave Turbo codes their name, as it resembles the "feedback-of-exhaust" used in combustion 
turbo engines. Inputs to the decoder are the received channel values (systematic, parityl and parity2); during the very 
first MAPI operation, the a priori information is set to zero. 

[0108] Concerning the MAP algorithm, equations 2.1-2.7 are used for Turbo decoding as well as equations 2.8-2.12 
and equations 3.1-3.2. 

[0109] Computation of y includes the estimation of channel values and the a priori information. Whereas the con- 
ventional method is quite complicated, an optimized branch metric calculation is used. Prior to transmission, every bit 
is subject to a transformation. Let 

x k e {),i} 

denote the (coded) bit, then the transmitted value is 

y k = -2■x k+ ^, 

hence 

y k e £ 1,1}. 

[0110] Thus the actual mapping is T -> '-1' and '0' -> T. 
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[0111] There are only four different values per k in total the y can take, one for every assumption 



The code-structure alone determines, which of them is assigned to which transition. After skipping constant factors 
and making additional algebraic transformations we get : 



f(jc; = +i,< = +i) = o 

4E v 



p 

sS k 



K (2.17) 



f(xl = -l,x: = + l) = ^- + L(d k ) 



N. 



0 



f « = -l, < =-i)= 4££ + M>i + L {d t ) 

[01 12] This simplifies the implementation significantly, as only two terms have to be computed from the channel and 
a priori data. One term can be dropped completely and the last one be computed from the first two. The scaling factor 

— - is multiplied externally by usage of a working point. 

[0113] 5.2 The combined decoder CTD according to this embodiment, as illustrated on figure 13, comprises Turbo- 
code decoding means TCDCM for performing Turbo-code decoding, and convolutional code processing means CCD- 
CM for performing convolutional code. 

[01 14] Turbo-code and convolutional code decoding means comprise common processing means CCPR implement- 
ing MAP algorithm and having a first configuration dedicated to Turbo-code decoding and a second configuration 
dedicated to convolutional code decoding. Said common processing means CCPR or MAP unit form a Soft-in-Soft- 
out unit (SISO unit) on which MAPI and MAP2 operations are done serially for turbo decoding, until a stopping criterion 
is fulfilled. 

[01 1 5] Further to these common processing means, the Turbo-code decoding means TCDM comprises conventional 
interleaving means IL. 

[0116] Moreover, the combined decoder CTD comprises metrics memory means referenced as TC alpha-RAM for 
storing forward state metrics associated to the states of a first trellis (8 states here), said forward state metrics being 
delivered by the processing means CCPR in its first configuration (Turbo decoding). 

[0117] Input/output memory means referenced as CC I/O RAMs are provided for storing input and output data de- 
livered to and by the processing means CCPR in its second configuration, i.e. for CC decoding. 
[01 1 8] At last, adaptable memory means ADMM are used for storing input and output data delivered to and by said 
processing means CCPR in its first configuration (Turbo-code decoding), and for storing forward state metrics asso- 
ciated to the states of a second trellis (256 states here) and delivered by said processing means CCPR in its second 
configuration (convolutional code decoding). 

[0119] Control means CDRLM configurate said common processing means CCPR in its first or second configuration 
depending on the kind of code, and memory control means CTMM address differently said adaptable memory means 
ADMM depending on the configuration of said common processing means CCPR. 

5.3 Architecture of the common processing means CCPR 

[0120] Figure 1 4 illustrates more in detail thus internal structures of the common processing means CCPR. 
[0121] The processing means CCPR comprise essentially three major of calculation unit, i.e., 

the BM unit that calculates the branch metrics BM and controls the complete I/O RAMs and alpha RAM ; 

the state metric unit which calculates 8 state metrics SM in parallel with 8 add-compare-select (ACS) units ; 

the LLR unit, which calculates the LLRs in a pipelined manner. 
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5.4 Input/output memory means CC I/O RAMs for convolutional decoding 

[0122] The input RAMs consist in three RAMs referenced as G0_RAM G1_RAM and G2_RAM according to the 
respective CC input data. Output RAM is CCLLR_RAM. The RAM-sizes are 512*6 bits. 

5 

5.5 Metrics memory means TCoc-RAM for Turbo decoding 

[0123] The alpha state metric RAM for TC decoding is a dedicated 64*88bit RAM (alpha_RAM). 

10 5.6 Adaptable memory means ADMM 

[01 24] The adaptable memory means ADMM is used either for storing input and output data for Turbo-code decoding 
or for storing forward state metrics in the convolutional code decoding. 

[0125] And, as shown on figure 15, these adaptable memory means comprises a lot of elementary memories as well 

15 as an additional memory means Add. 

[0126] More precisely, in the present application, the Turbo-code decoding means are adapted to receive successive 
sequences of N1 symbols (N1 =51 20) of b1 bits (b1 =6). The input and output data delivered to and by the processing 
means CCPR in its first configuration (Turbo decoding) comprises for each received sequence, g different blocs of N1 
words of b1 bits. Here, g=4 and these g blocs are 

20 the systematic input data X, 

the parity input data Y1 , 
the interleaved parity input data Y2 and, 
the decisions of the decoding, as output data. 
[0127] The forward state metrics to be stored in the adaptable memory means (when convolutional code decoding) 

25 is a bloc of N2 words (N2=1 728) of b2 bits, b2 being here greater than b1 . Generally, the product N2 times b2 is equal 
to the product of W (windowsize) with the number of states of the trellis and, with the number of bits for each state. In 
the present case, W is equal to 54 or the convolutional code, 64 for the Turbo-code. The number of states is equal to 
8 for the Turbo-code and to 256 for the convolutional code. And, the number of bits for each state is equal to 11 . 
[0128] Accordingly, N2 is equal to 1728 whereas b2 is equal to 88. 

30 [0129] Thus, as shown on figure 8, the main memory means of the adaptable memory means ADDM comprises 4 
groups of p (p=3) elementary memories respectively dedicated to the g blocs of N1 words. And, each elementary 
memory is adapted to store N2 words of b1 bits. 

[0130] The additional memory Add is adapted to store 1728 words of 16 bits. 

[0131] And, generally speaking, the memory control means address the adaptable memory means ADMM in said 
35 first configuration (Turbo-code decoding) such that each bloc of 5120 words of 6 bits is written in or read from its 
dedicated group of 3 elementary memories. 

[0132] Further, said memory control means address said adaptable memory means ADMM in said second configu- 
ration (convolutional code decoding) such that the twelve elementary words of said forward state metrics are respec- 
tively stored in the twelve elementary memories of the main memory means at the same address, whereas the additional 
40 elementary word of the forward state metrics (the 1 6 other bits) is stored in the additional memory means at the same 
address. 

[0133] In other words, for CC decoding each of the I/O RAMs of the TC is split into 3 separate RAMs. These RAMs 

are the concatenated to form the required bitwidth for the storage of 8 state metrics in parallel. We need 88 bits (8* 11 

bit), the I/O RAMs are 6 bit wide so we get 4*3*6 bit=72 bit. Therefore we need an additional 16bit RAM to form the 
45 CC alpha RAM. This RAM sharing enables us to get a sufficient windowsize for CC decoding. 

[0134] As shown on figure 15, the naming conventions are as follows: 

[0135] The systematic input data is stored in X_RAM1 , X_RAM2, X_RAM3, 
the parity input data is stored in Y1_RAM1 , Y1_RAM2, Y1_RAM3, and 
the interleaved parity input data is stored in Y2_RAM1 , Y2_RAM2, Y2_RAM3. 
50 [0136] The TC output RAMs are LLR_RAM1, LLR_RAM2 and LLR_RAM3. The MSB represents the output hard 

decision, whereas the LSB represent the extrinsic information/LLR soft decision (depending on actual decoding 

progress). This enables a stopping of the decoding after MAPI CRC check. 

[0137] Because the RAMs are split into three, an appropriate address transformation is done to map the address 
space 0-5119 to three times 0-1727. This value is rounded up to the minimum feasible RAM. Only the actual needed 
55 RAMs are activated, all others are deselected. 
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5.7 Calculation of the branch metrics 

[0138] The common processing means comprises a branch metric unit for calculating in each first and second con- 
figurations the branch metrics associated to the branches of the corresponding trellis (see eq. 2.17 for turbo code). 
[0139] The transfer of TC input data samples to the CTD decoder is done in the following order : 

X A , Y\ , ,X 2 , Y\ , Vj , X Bh Y 2 Bj , TailBits 

Bj is the number of bits in the r th code block. 

The storage of the tail bits has to be considered, the transmitted bits for the trellis termination are: 

*K+1, Z K+1> *k+2> z k+2> x k + 3> z k + 3> *K+1> Z W + 1 . x 'k + 2> z 'k + 2> x 'k+3> z 'k + 3> where Y1 = z > Y2 = z ' and x'=XSecond encoder, 
K=block length. 

The parity data RAMs Y1 and Y2 are filled sequentially, the respective three tailbits are just appended. The systematic 
data RAM X is also filled sequentially, because there are six tailbits in total, the tailbits for the first encoder are appended 
first, then the tailbits for the second encoder follow. 

[0140] The input sequence for CC decoding is simpler, because there is only one tail sequence. The tailbits are just 
appended to the respective RAM. 

[0141] The different steps of the windowing scheme can be divided into further sub-steps. The calculation of the 
branch metrics is the prerequisite for the calculation of the state metrics. The memory control mean therefore addresses 
the input RAMs to calculate the respective branch metrics. We use the optimized MAP calculation scheme proposed 
above. The used naming convention is based on the binary representation of the pair [systematic/parity information] 
E{0,1} resp. [G0/G1], [G0/G1/G2]. X, Y, LLR, GO, G1, G2 represent the individual data stored in this RAMs (e.g. the 
content of the LLR-RAM is the extrinsic information). 

Turbo Code: 

branchO = 0 
branchA = Y 
branch2 = X+ LLR 
branch3 = X+ Y+ LLR 

Convolutional Code, rate 1/2: 

branchO = 0 
branchA = G1 
branch2 = GO 
branch3 = G1 + GO 

Convolutional Code, rate 1/3: 

branchO = 0 
brancM = G2 
branch2 = G1 
branch^ = G1 + G2 
branchA = GO 
branch5 = GO + G2 
branch^ = GO + G1 
branch! = GO + G1 + G2 

[0142] The branch metric calculation is very simple, two additions for TC and one resp. four additions (calculation 
of branch7 reuses a previously addition, e.g. G0+G1) for CC. In case of TC we have to use interleaved data during 
MAP2 operation. Therefore the BM unit interacts with the external interleaver to fetch the appropriate addresses. In 
order to avoid data collisions during forward and backward TC recursion, a dedicated LLR-cache is needed as well as 
a dedicated LLR-register. 

5.8 Calculation of the state metrics 

[0143] Turning now to the calculation of the state metrics, the common processing means comprises a configurable 
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state metrics unit SM. 

[0144] As shown on figures 14, 15, and 16, the configurable state metrics units SM comprises 

an architecture of 8 parallel ACS (Add, Compare, Select) units for calculating in each configuration 8 forward state 
metrics, 

auxiliary memory means (AXMM) for temporarily storing the calculated forward state metrics for a recursive cal- 
culation, and 

auxiliary control means for controlling the storage of the metrics in said auxiliary memory means depending on the 
configuration of said common processing means. 

5.9 ACS architecture 

[0145] More precisely, the ACS architecture illustrated on figure 16, calculates eight state metrics in parallel out of 
the branch metrics and the previous state metrics according to equations 2.10 and 2.1 1 . This is done with 8 ACS (Add, 
Compare, Select) units based on a modmin-procedure (the MODMIN blocs perform the min* operator). 
[0146] This ACS architecture is used for forward as well as for backward recursion for both Turbo code and Convo- 
lutional code. Because of the different trellis diagrams, the flexibility is achieved with multiplexers for the incoming 
branch metrics (bmux) and state metrics (smux). These multiplexers are controlled externally as explained thereafter. 
[0147] The eight ACS units need sixteen branch metric + state metric sums, therefore sixteen bmux multiplexers are 
provided. Each of these bmux multiplexers can select among eight branch metrics (because CC rate 1/3 needs eight 
different branch metrics in total). However, because of a particular state metric distribution not sixteen but only twelve 
smux state metric multiplexers are needed. Furthermore this state metric distribution leads to only 2:1 smux multiplex- 
ers, because the setting is valid for either CC forward recursion/TC backward recursion or CC backward/TC forward 
recursion. There is no additional multiplexing of the calculated state metrics. In case of forward recursion, the new 
state metrics are always in ascending order. Figure 1 6 shows the general architecture of the acs architecture. Sm1 -sm8 
denote the 8 parallel input state metrics at one timestep, bm0-bm7 denote the up to 8 different branch metrics. The 
smux setting for TC forward/CC backward is marked bold. The output of this unit are the new state metrics as well as 
the LLR-metrics (pstate metric + branch metric) for the LLR calculation. 

5.10 Control of the multiplexers 

[0148] The multiplexer of the state metrics unit are controlled by a specific machine, not shown for reason of simpli- 
fication. 

[0149] The state metric multiplexer smux controlling is very simple. There are only two different settings, either for 
TC forward recursion/CC backward recursion or for TC backward recursion/CC forward recursion. The controlling of 
the branch metric multiplexer bmux is more complex and done in the FSM. Each sequence (32 steps) depends on the 
chosen rate as well as on the actual operation (forward recursion or backward recursion). 

CC rate 1/2 forward recursion 

There are 4 different sets of bmux settings, denoted A,B,C,D 

The forward sequence is 
AABBCCDDAABBCCDDCCDDAABBCCDDAABB 
CC rate 1/2 backward recursion 

There are 4 different sets of bmux settings, denoted A,B,C,D 

The backward sequence is 
AABBCCDDAABBCCDDBBAADDCCBBAADDCC 
CC rate 1/3 forward recursion 

There are 8 different sets of bmux settings, denoted 
A,B,C,D,E,F,G,H 

The forward sequence is 
ABCDEFGHFEHGBADCHGFEDCBACDABGHEF 
CC rate 1/3 backward recursion 

There are 8 different sets of bmux settings, denoted 
A,B,C,D,E,F,G,H 

The backward sequence is 
ABCDEFGHFEHGBADCDCBAHGFEGHEFCDA 



16 



EP 1 471 677 A1 

5.11 The LLR unit 

[0150] The LLR unit calculates the LLR (see eq. 2.12 for turbo code). This is done in a pipeline for the TC decoder 
consisting of three modmin stages with registers between stage one and stage two see Figure17. Input to the first 

5 stage are the sums alpha state metric from the alpha RAM + LLRsum (= branch metric + beta state metric) from the 
SM unit. This values are also registered, thus resulting in atotal pipeline depth of four. The upper modmin-tree calculates 
the minimum of all states reached by input '1 ' (LLR1 ), the lower one the minimum reached by input '0' (LLRO). 
[0151] Once the LLR calculation starts, new values are present at the inputs every clock cycle. The control is very 
simple and done by a simple shifting of a data valid flag through a 4 stage Flip-Flop pipeline. 

w [0152] Parts of the LLR calculation for TC are reused for Convolutional decoding, see figure 1 7 (not shown are the 
multiplexer for the input values and the adder for calculating the sums). 

[0153] More precisely, all the architecture shown on figure 17 is used for Turbo decoding whereas only the upper 
and lower parts thereof are used for convolutional code decoding. 

[0154] Since the encoding of the Convolutional Code is done with a NSC, the input '0' resp. '1 ' determines the state 
15 number. 

[0155] Therefore we don't need the branch metrics, which simplifies the calculation to only four modmin units in the 
first stage. The upper two modmin units calculate the minimum of the four state metric sums reached by input 'V 
(onestates), the lower two the minimum of input '0' (zerostates). Therefore the inputs are the alpha state metrics from 
the alpha-RAM and the beta metrics from the AXMM memory. 
20 [01 56] Because we don't calculate all 256 states at once we need a loopback to determine the appropriate minimums. 
This is additional hardware compared to the Turbo decoder part and situated after stage 2. The controlling is quite 
simple and realized with a FSM. This can be best expressed when looking at the feedback unit for the onestates. The 
first minimum is stored in the lower register, the second one in the upper register. All following minima are stored in 
the lower register, the resulting minimum always in the upper register. The LLR_valid flag is also generated by the FSM. 

25 

5.12 Global control unit 

[0157] At last, the combined decoder according to the invention comprises a global control unit which controls the 
decoding process on MAP level. Because TC decoding is done in an iterative matter, the number of iterations depends 
30 on the actual decoding status. Therefore after each MAP operation a stopping criterion is checked. This stopping 
criterion can either be the selected total number of half-iterations, a correctly detected CRC-sum (only after MAPI ) or 
an early detection of undecodable blocks based on the mean value criterion. The particular decoding steps for TC are 
shown in figure 1 8. In case of CC decoding only one MAPI operation is needed. 

[0158] Furthermore the global control unit controls the handshake mode. This handshake mode allows step-by-step 
35 execution of the decoding steps and on-demand memory flush. 



Claims 

40 1. Method of blindly detecting a transport format of an incident convolutional encoded signal, said transport format 
being ex ante unknown and belonging to a set of MF predetermined reference transport formats, said signal com- 
prising a data block having an ex ante unknown number of bits corresponding to said ex ante unknown transport 
format, and a cyclic redundancy check (CRC) field containing a transmitted CRC word, said method comprising 
decoding said signal using a Maximum-a-Posteriori algorithm, said decoding step including decoding said signal 

45 considering respectively the MF possible reference formats and respectively delivering MF corresponding groups 

of soft output information, calculating from each group of soft output information a calculated CRC word and com- 
paring said calculated CRC word with said transmitted CRC word, selecting all the groups for which said calculated 
CRC word is equal to said transmitted CRC word, and selecting the actual transport format of the encoded signal 
from at least one soft output information among the last ones of each selected group. 

50 

2. Method according to claim 1 , characterized by the fact that said the actual transport format of the encoded signal 
is selected from the last soft output information of each selected group. 

3. Method according to claim 2, characterized by the fact that the actual transport format is the reference format 
55 having the greatest last soft output information. 

4. Method according to any one of the preceding claims, characterized by the fact that said decoding step comprises 
calculating states metrics, by the fact that all the data blocks are decoded in parallel window-by-window on sliding 
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windows having a predetermined size, and by the fact that at least some of the states metrics calculated on a 
window are valid for all the data blocks. 



5. Method according to claim 4, characterized by the fact that said decoding step comprises for each window cal- 
5 culating forward state metrics during a forward recursion, performing a backward acquisition having a predeter- 

mined acquisition length, calculating backward state metrics during a backward recursion and calculating soft 
output information in the reverse order, by the fact that for each window only one forward recursion is performed 
which is valid for all the transport formats. 



10 6. Method according to claim 5, characterized by the fact that the first window processing including forward recursion, 
backward acquisition, backward recursion and soft output calculation, is completely valid for all data blocks having 
a size larger than the sum of the window size and the acquisition length. 



7. Convolutional code decoder, comprising input means for receiving an convolutional encoded signal having an ex 
15 ante unknown transport format belonging to a set of MF predetermined reference transport formats, said signal 

comprising a data block having an ex ante unknown number of bits corresponding to said ex ante unknown transport 
format, and a cyclic redundancy check (CRC) field containing a transmitted CRC word, convolutional code decoding 
means implementing a Maximum-a-Posteriori algorithm for decoding successively said signal by considering re- 
spectively successively the MF possible reference formats and comprising a Log-Likelihood-Ratio unit for succes- 

20 sively respectively delivering MF corresponding groups of soft output information, blind transport format detection 

means comprising a cyclic redundancy check unit for calculating from each group of soft output information a 
calculated CRC word and comparison means for comparing said calculated CRC word with said transmitted CRC 
word, first selection means for selecting all the groups for which said calculated CRC word is equal to said trans- 
mitted CRC word, and second selection means for selecting the actual transport format of the encoded signal from 

25 at least one soft output information among the last ones of each selected group. 

8. Convolutional code decoder according to claim 7, characterized by the fact that said second selection means are 
adapted to select the actual transport format of the encoded signal from the last soft output information of each 
selected group. 

30 

9. Convolutional code decoder according to claim 8, characterized by the fact that the actual transport format is the 
reference format having the greatest last soft output information. 



1 0. Convolutional code decoder according to any one of claims 7 to 9, characterized by the fact that said convolutional 
35 code decoding means are adapted to calculate states metrics, by the fact that all the data blocks are decoded in 

parallel window-by-window on sliding windows having a predetermined size, and by the fact that at least some of 
the states metrics calculated on a window are valid for all the data blocks. 



1 1 . Convolutional code decoder according to claim 1 0, characterized by the fact that said convolutional code decoding 
40 means are adapted for each window, to calculate forward state metrics during a forward recursion, to perform a 

backward acquisition having a predetermined acquisition length, to calculate backward state metrics during a 
backward recursion, by the fact that for each window only one forward recursion is performed which is valid for all 
the transport formats. 

45 12. Convolutional code decoder according to claim 11, characterized by the fact that the first window processing 
including forward recursion, backward acquisition, backward recursion and soft output calculation, is completely 
valid for all data blocks having a size larger than the sum of the window size and the acquisition length. 

13. Convolutional code decoder according to any one of claim 7 to 12, characterized by the fact it is a combined 
50 Turbo-code/convolutional code decoder further comprising Turbo-code decoding means (TCDCM) for performing 

turbo-code decoding. 

14. Convolutional code decoder according to claim 13, characterized by the fact that said Turbo-code and convolu- 
tional code decoding means comprising common processing means (CCPR) having a first configuration dedicated 

55 to Turbo-code decoding and a second configuration dedicated to convolutional code decoding, and by the fact that 

said decoder further comprises : 

metrics memory means (TCa-RAM) for storing state metrics associated to the states of a first trellis and de- 
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livered by said processing means in its first configuration, 

input/output memory means (CC I/O RAMs) for storing input and output data delivered to and by said process- 
ing means in its second configuration, 

adaptable memory means (ADMM) for storing input and output data delivered to and by said processing means 
in its first configuration, and for storing state metrics associated to the states of a second trellis and delivered 
by said processing means in its second configuration, 

control means (CTRLM) for configurating said common processing means in its first or second configuration 
depending on the kind of code, and 

memory control means (CTMM) for addressing differently said adaptable memory means depending on the 
configuration of said common processing means. 

15. Convolutional code decoder according to any one of claims 7 to 14, characterized by the fact that said common 
processing means (CCPR) implements a Maximum-A-Posteriori (MAP) algorithm. 

16. Convolutional code decoder according to claim 15, characterized by the fact that the MAP algorithm implemented 
is a so-called LogMAP algorithm or a so-called MaxLogMAP algorithm. 

17. Convolutional code decoder according to any one of claims 7 to 1 6, characterized by the fact it is realized by an 
integrated circuit. 

18. Terminal of a wireless communication system, including a decoder according to any one of claims 7 to 17. 

19. Terminal according to claim 18, characterized by the fact it forms a cellular phone. 

20. Terminal according to claim 18, characterized by the fact it forms a base station. 
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FIG.8 
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